Comparing compact codebooks for visual categorization
نویسندگان
چکیده
In the face of current large-scale video libraries, the practical applicability of content-based indexing algorithms is constrained by their efficiency. This paper strives for efficient large-scale video indexing by comparing various visual-based concept categorization techniques. In visual categorization, the popular codebook model has shown excellent categorization performance. The codebook model represents continuous visual features by discrete prototypes predefined in a vocabulary. The vocabulary size has a major impact on categorization efficiency, where a more compact vocabulary is more efficient. However, smaller vocabularies typically score lower on classification performance than larger vocabularies. This paper compares four approaches to achieve a compact codebook vocabulary while retaining categorization performance. For these four methods, we investigate the trade-off between codebook compactness and categorization performance. We evaluate the methods on more than 200 hours of challenging video data with as many as 101 semantic concepts. The results allow us to create a taxonomy of the four methods based on their efficiency and categorization performance.
منابع مشابه
Création de Vocabulaires Visuels Efficaces pour la Catégorisation d’Images. Creating Efficient Visual Codebooks for Image Categorization
We propose in this article an automatic method for building visual codebooks. Codebooks are obtained by quantizing local image descriptors and are used to automatically build discriminative representations of objects occuring in images. We describe an image categorization application based on the proposed approaches, providing results far above related state of the art existing methods.
متن کاملUnsupervised and Supervised Visual Codes with Restricted Boltzmann Machines
Recently, the coding of local features (e.g. SIFT) for image categorization tasks has been extensively studied. Incorporated within the Bag of Words (BoW) framework, these techniques optimize the projection of local features into the visual codebook, leading to state-of-theart performances in many benchmark datasets. In this work, we propose a novel visual codebook learning approach using the r...
متن کاملSpeeded-up and Compact Visual Codebook for Object Recognition
The well known framework in the object recognition literature uses local information extracted at several patches in images which are then clustered by a suitable clustering technique. A visual codebook maps the patch-based descriptors into a fixed-length vector in histogram space to which standard classifiers can be directly applied. Thus, the construction of a codebook is an important step wh...
متن کاملKernel Codebooks for Scene Categorization
This paper introduces a method for scene categorization by modeling ambiguity in the popular codebook approach. The codebook approach describes an image as a bag of discrete visual codewords, where the frequency distributions of these words are used for image categorization. There are two drawbacks to the traditional codebook model: codeword uncertainty and codeword plausibility. Both of these ...
متن کاملAnalysis of Visual Impacts in Compact City’s Form
Desired physical form of cities has been noticeable since the beginning of urbanization, from old patterns of early civilizations to the latest urbanism’s theories, which offered to build better cities. The opinions in recent decades have expressed that compact physical form of cities is a better form than sprawl form to achieve urban sustainability. The form of the city is the embodiment of it...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Computer Vision and Image Understanding
دوره 114 شماره
صفحات -
تاریخ انتشار 2010